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Abstract. We investigate finite deterministic automata in sets with 
non-homogeneous atoms: integers with successor. As there are uncount- 
ably many deterministic finite automata in this setting, we restrict our 
attention to automata with semilinear transition function. The main re- 
sults is a minimization procedure for semilinear automata. The proof is 
subtle and refers to decidability of existential Presburger arithmetic with 
divisibility predicates. Interestingly, the minimization is not obtained by 
the standard partition refinement procedure, and we demonstrate that 
this procedure does not necessarily terminate for semilinear automata. 



1 Introduction 

This paper is a successor of a line of research aiming at studying models of 
computation in a new set theory, namy in sets with atoms (for motivation and 
a more detailed exposition see for instance [3, 5,4]). 

In set theory, elements of sets are other sets, organized in a well-founded way. 
In this paper we work in a new set theory, where one additionally postulates 
an infinite set of atoms. Then elements of sets are either other sets, or atoms, 
while atoms themselves have not elements. Examples of atoms that are often 
considered are equality atoms (N, =), i.e., natural numbers with equality, or 
total order atoms (Q, <), i.e., rational number with the natural order. In this 
paper we focus on integer atoms 

(Z,+l), 

i.e., the integers with the successor function. 

In general, atoms are an algebraic structure over some vocabulary. The struc- 
ture is typically assumed to be homogeneous [12], i.e., to satisfy the following 
condition: every isomorphism between finite substructures extends to an auto- 
morphism of the whole structure. Sets with atoms have particularly good proper- 
ties when atoms are a relational homogeneous structure over a finite vocabulary. 
Examples are the equality atoms or the total order atoms, but not the integer 
atoms. 1 

Sets with atoms were discovered in the 20ies by Fraenkel, and then investi- 
gated by Mostowski and others. In 90ies, sets with atoms have been rediscovered 



1 Integer atoms are homogeneous if one weakens the definition: only isomorphisms 
between fimtetely generated substructures extend to the whole structure. 



in semantics [10,9]. The paper [3] rediscovers sets with atoms in automata the- 
ory and observes that one can naturally study different models of computation 
in the new set theory. The principal difference is that the notion of finitcness 
in sets with atoms is replaced by a more liberal notion of orbit-finiteness. This 
approach allows to capture in an elegant syntax-independent way some models 
of automata, for instance finite memory automata of Francez and Kaminski [8] , 
or timed automata of Alur and Dill [2]. The paper [3] proves also the analog of 
Myhill-Nerode theorem for orbit-finite automata. The minimization is effective 
due to a powerful finite representation theorem provided in the paper. However, 
the representation theorem holds only when atoms are a homogeneous relational 
structure over a finite vocabulary. 

Contribution of the paper. The present paper is a natural attempt to extend 
the above-mentioned results to non-homogeneous atoms. We investigate sets with 
atoms (Z, +1) and orbit-finite automata therein. However, the finite representa- 
tion theorem of [3] is not applicable any more, and as one of the consequences the 
automaton model is far too powerful. In particular, there is uncountably many 
non-equivalent orbit-finite automata in sets with the integer atoms, thus the au- 
tomata may not be finitely presented as an input to a procedure. This indicates 
a necessity of a reasonable restriction of the power of automata. Our restriction 
applies to transition relation of an automaton and requires this relation to be 
scmilinear. 

It turns out the under the restriction, the deterministic orbit-finite automata 
with the integer atoms admit a minimization procedure. This is the main re- 
sult of this paper. The proof is a surprisingly subtle and complicated reduction 
to satisfiability of existential Presburger arithmetic with divisibility. The latter 
problem was shown decidable in [11]. 

Related research. The integer atoms exhibit significant similarity to the timed 
atoms, i.e., the structure 

(Q,<,+i). 2 

In [5] orbit-finite automata with the timed atoms have been shown to subsume 
timed automata [2]. Moreover, effective minimization has been shown for a sub- 
class of automata rich enough to subsume timed automata: automata with timed 
atoms, with transition relation definable in FO(Q, <,+l) without quantifiers. 
This amounts to a more severe restriction than the semilinear restriction con- 
sidered in this paper. In fact we suppose that techniques similar to those used 
in this paper would also apply to semilinear automata with timed atoms. 

Other approaches to minimization of timed automata are discussed in [1, 14, 
16,15]. 

An extension of the representation theorem to non-homogeneous atoms has 
been recently formulated proved in [13]. It applies both to the integer atoms and 
to the timed atoms. 



2 Instead of (Q, <, +1), equally-well one could consider atoms (E, <, +). 
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2 Preliminaries 

Sets with atoms. The set of atoms is assumed to come equipped with some 
algebraic structure, like the rationals Q with the natural order. The notion of 
sets with atoms makes sense for any algebraic structure of atoms. We fix in this 
section an arbitrary such structure, even if later on we will stick to the particular 
structure (Z, +1), the integers with successor. 

The cumulative hierarchy of sets is a sequence of sets indexed by ordinal 
ranks. At any ordinal, sets of this rank are arbitrary sets whose elements are 
sets of smaller rank, or atoms. In particular, sets of rank are all subsets of 
atoms. We restrict to only those sets in the cumulative hierarchy that are well- 
behaved in the sense described below. 

The intuitive idea is that the given algebraic structure is the only relevant 
structure of atoms, thus we work 'up to automorphism of atoms'. Formally, 
given an automorphism ir of atoms, i.e., a bijection that preserves the algebraic 
structure, 7r may be naturally lifted to any set in the cumulative hierarchy: apply 
7r to all elements, then to all elements of elements, etc. Application of it to a 
set X we denote by X ■ ir. A set S of atoms is said to support a set x if every 
automorphism ir being identity on S preserves x as well: X ■ ir = X . All atoms 
trivially support any set. As an example, consider the equality atoms and the 
set x of all atoms except 3 and 5, which is supported by {3, 5}. If we move to 
the total order atoms, the set {3, 5} supports also the open interval (3, 5). In the 
sequel we are interested in finitely-supported sets, i.e., those supported by some 
finite set of atoms. 

Now we are prepared to define well-behaved sets: a set in cumulative hi- 
erarchy is hereditarily finitely- supported if it is finitely-supported itself, all its 
elements are finitely-supported, etc. A hereditarily finitely-supported set that is 
itself supported by the empty set of atoms we call equivariant. A particular but 
important special case is that of equivariant function, i.e., a function / : X — > Y 
between two sets that commutes with any automorphism ir of atoms: 

f(x ■ ir) = f(x) ■ ir for all x e X and all automorphisms ir. 

The new set theory has a more liberal notion of finiteness than the usual one: 
orbit-finiteness. For a set S of atoms, the S-orbit of a set x is defined as follows: 

{x ■ ir : ir is an automorphism of atoms being identity on S}. 

If S is finite, the above set is called single- orbit. A set is orbit- finite if it is the 
union of finitely-many single-orbit sets. Note however that the precise number 
of orbits of a set may vary depending on the choice of S. 

Sets with atoms have particularly good properties if the atoms are a homo- 
geneous relational structure over a finite vocabulary [12]. Examples are equal- 
ity atoms, total order atoms, or the random graph considered as a relational 
structure. One of these good properties is preservation of orbit-finiteness by the 
Cartesian product. In consequence, an n-ary relation between orbit-finite sets is 
orbit-finite as well. 
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Sets with integer atoms. From now on we only consider integer atoms; the 
automorphism group contains all translations x \-> x + z, for some z e Z and 
is thus isomorphic to Z with addition. This choice of atoms leads to a slightly 
pathological set theory. As an example, observe that every set is supported by 
any singleton {z} C Z as the translation with a fixed point is necessarily the 
identity. Thus all sets are in the cumulative hierarchy are hereditarily finitely- 
supported. This is why we will mostly work with equivariant sets in the sequel. 

The structure (Z, +1) is not homogeneous as a relational structure; in fact 
there is no extension of the structure by a finite set of relations that would make 
the structure homogeneous. It is thus not surprising that integer atoms lead to 
a set theory that lacks certain important properties. One of the most important 
problems is that orbit-finitcness is not preserved by the Cartesian product. For 
instance, the atoms Z is a single-orbit set, but the product Z x Z has infinitely 
many orbits, namely for each k E Z, the diagonal {(i, i + k) : i 6 Z} is an orbit. 
One of the consequences is that there are uncountably many equivariant binary 
relations in Z. 

By an equivariant isomorphism we mean a bijection between two sets that 
commutes with automorphisms of atoms. For k > 0, let Z^ denote integers mod- 
ulo k, i.e., the set of equivalence classes of the congruence modulo k. In particular 
Zo is the same as Z up to equivariant isomorphism. From the representation the- 
orem of [13] it follows that single-orbit sets with integer atoms are particularly 
simple: 

Lemma 1. Up to equivariant isomorphism, every one-orbit set with integer 
atoms is Z fe for some k > 0. 

Automata with integer atoms. For any atoms, one can naturally define 
nondeterministic and deterministic finite automata. For instance, a NFA consists 
of: 

— an orbit-finite alphabet A, 

— an orbit-finite set Q of states, 

— a transition relation S C Q x A x Q, 

— subsets I, F C Q of initial and accepting states. 

All sets above are implicitly assumed to be hereditarily finitely-supported. A 
DFA is obtained by additionally restricting the set / to be a singleton and the 
relation S to be a function Q x A — > Q. Note that a NFA is equivariant iff all its 
components A, Q, I, F and 8 are so. 

For many choices of atoms, in particular for homogeneous atoms, the notion 
of automaton outlined above is very reasonable. For instance, if the alphabet 
is just the set of atoms, equivariant NFA with equality atoms are expressively 
equivalent to the finite memory automata of Francez and Kaminski [8] (called 
also register automata [6]). Similarly, in case of total order atoms, equivari- 
ant NFA correspond to a variant of finite memory automata with order testing 
(cf. e.g. [7]). 

Unfortunately, automata with integer atoms are far too powerful, as illus- 
trated in the example below. 
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Example 1. Suppose that K C Z is any set of integers, e.g. the positive integers 
that are prime numbers. Consider the following language 

di&(K) = {xi . . . x n e Z* : for alii G {2, . . . , n}, x { - x t -i G K}. 

We claim that this language is recognized by an equivariant DFA with integer 
atoms. The automaton has: an initial state e, a sink error state _L, as well as 
one state z for every z G Z. It is easy to see that there are three orbits. The 
transition function is 



S(e,y) = y 

I _L otherwise 

S(±,y) = ± 

Of course, this is not a true finite automaton, because it refers to the set K in its 
transition relation, and the set K could be anything, e.g. undccidable. Note that 
there are uncountably many sets K, and each one gives a different language. 

This is why starting from Section 3 on we restrict our attention to semilinear 
automata. 



3 Semilinear sets with integer atoms 

A problem with the integer atoms is that the Cartesian product does not preserve 
orbit-finiteness. To overcome this difficulty we widen our interest from orbit-finite 
sets to polynomial sets, defined below. 

Polynomial sets. For any atoms, one may use the name polynomial sets for 
the smallest class of sets that contains all equivariant orbit-finite sets and is 
closed under finite products and disjoint unions. Thus a polynomial set is a finite 
disjoint union of monomial sets, i.e., of finite products of equivariant one-orbit 
sets. 

Under the integer atoms, by Lemma 1 every monomial set is, up to equivari- 
ant isomorphism, of the form 

%ki x Z k2 x • • • x Z kn where fci, . . . , k n G N. (1) 

Without loss of generality we only consider monomials of the form Z fc or Z&: 

Lemma 2. Every polynomial set is isomorphic to a finite disjoint union of 
monomials of the form Z fe or Z k , for k G N. 

Proof. Using Lemma 1 and the following identities: 

— for k > 1 there is an equivariant isomorphism between Z^ x Z and the disjoint 
union of k copies of Z; 
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— for k,l > 1 there is an equivariant isomorphism between Z& x Z; and Z m , 
where m is the least common multiplicity of k and I. 

a 

Semilinear sets with integer atoms. As we have remarked above, the set 
Z 2 , when seen as a set with integer atoms, is isomorphic to a disjoint union of a 
countably infinite number of copies of Z. As a consequence, there are uncountably 
many equivariant subsets of Z 2 : just choose any subset of the infinitely many 
orbits. This means that there is no hope of algorithms working with arbitrary 
equivariant subsets of the monomial Z 2 . This motivates us to restrict to subsets 
of polynomial sets that are not just equivariant, but also semilinear, as defined 
below. 

The standard notion of semilinear sets applies to subsets of monomials of 
the form Z fc . For monomials of the form every subset is considered to be 
semilinear. Then we extend definition to all monomials of the form: 

Zfcj x Zfe 2 x • • • x Zk n where fci, . . . , k n £ N; 

a subset of such set is semilinear if it is semilinear when translated along the 
isomorphism used in the proof of Lemma 2. The property does not depend on 
the choice of an isomorphism. 

Definition 1 (Semilinear sets with integer atoms). Consider a subset 

R C Xx x . . . x X n 

of an arbitrary equivariant monomial set. R is semilinear if for some n-tuple of 
equivariant isomorphisms 

(fi : Xi -> Zfc 4 ) i= i... n , where ki, . . . , fc„ £ N, 

the image of R along the isomorphisms is a semilinear subset of the set 

Z fel x Z fe2 x • • • x Z fe „ 

in the sense described above. 

Finally, a semilinear subset of a polynomial set is defined by choosing a semi- 
linear subset of each of its monomials. 

Note certain delicacy of the above definition. Observe that Z 3 , when interpreted 
as a set with integer atoms, is, similarly like Z 2 , isomorphic to a countable disjoint 
union of copies of Z. Therefore there is an equivariant isomorphism between Z 2 
and Z 3 ; consider for instance any bijection between orbits. However, a semilinear 
subset of Z 2 , translated to Z 3 via an equivariant isomorphism, is not a semilinear 
subset of Z 3 in general. 

Definition f immediately yields the notion of a semilinear n-ary relation on 
orbit-finite sets. Also as a special case, we get the notion of a semilinear function 
between polynomial sets. 
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We are interested in subsets of polynomial sets (which covers the case of 
relations and functions) which are both semilinear and equivariant. For instance, 
there is no semilinear and equivariant bijection between I? and Z 3 . 

We conclude this section with the following observations useful later: 

Lemma 3. Every orbit-finite subset of a polynomial set is semilinear. 
Corollary 1. Every equivariant function between single-orbit sets is semilinear. 

4 Semilinear automata and their decision problems 

Consider the integer atoms. A NFA is called semilinear if its transition relation 
is semilinear. 

Representation. In this section, we study semilinear automata and their de- 
cision problems. To speak of decision problems, we should first explain how a 
semilinear automaton is presented as the input to an algorithm. A finite rep- 
resentation of such automata is easily deducible from our knowledge collected 
by now. Basing on Lemma 1, we assume that every orbit is litcrrally and 
thus may be represented by the number k. An orbit-finite set is represented as a 
multiset of orbits. Transition relation of a semilinear automaton is represented 
separately for every monomial set. 

We start with the observation that we must restrict ourselves to equivariant 
automata only, as non-equivariant automata have undecidable emptiness, even 
in deterministic case: 

Theorem 1. Emptiness is undecidable for semilinear DFA with the integer atoms. 

Proof. Recall that every set is finitely supported, namely supported by every 
singleton subset of atoms. In consequence, the transition function 8 : Q x A — > Q 
of a semilinear automaton may be an arbitrary semilinear function. We claim 
that DFAs with semilinear transition function may simulate deterministic two- 
counter machines. 

Consider a deterministic two-counter machine with zero tests, which has n 
states. A configuration of this machine can be seen as an element of 

{0, ...,n- 1} x N x N. 

Denote by succ the function which maps a configuration to its successor. It is 
well known that the following question is undecidable: 

Given x and y, decide if there is some m such that y = succ m (x)l 

This question is undecidable even for a fixed machine, and of course also un- 
decidable when the machine is part of the input. Consider a Godcl coding of 
configurations as numbers defined by 

f(i,j,k)=n-(V -3 k ) + i. 
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Under this coding, the successor function is semilinar. More precisely, there is a 
scmilinear function g : Z — > Z such that for every configuration x = k), 

f(succ(x)) = g(j{x)). 

Therefore, it is undccidablc if there is some m such that g m {f{x)) = f(y). 

Define a semilincar DFA, with a singleton input alphabet, as follows. Its 
states are Z, the initial and accepting states are f(x) and f{y), and the transition 
function updates state n to g(n) ignoring input. The automaton is nonempty if 
and only if the two-counter machine halts. Thus emptiness of semilinear DFA is 
undecidable. □ 

Somehow surprisingly, equivariance of transition relation makes emptiness 
easily decidable: 

Theorem 2. Emptiness is decidable for semilinear equivariant NFA with the 
integer atoms. 

Proof. Consider an automaton, with input alphabet A, states Q, initial states 
ICQ, accepting states F, and transition relation S C Q x A x Q. Consider the 
sets 

Q d = I and Q n = f Q n -\ U 6(Q n -i, A) C Q for n > 1. 

It is easy to see that Q n is the set of states that can be reached after reading an 
input of length at most n, and that Q n can be computed based on Q n -\ using 
Presburger arithmetic. Since the set Q n is an equivariant subset of Q, there are 
finitely many possibilities for Q ni and therefore the chain Qq C Qi C • • • must 
stabilize at some point. If it stabilizes without containing an accepting state, the 
automaton is empty, otherwise the automaton is nonempty. □ 

The above theorem makes semilinear equivariant NFA look deceptively sim- 
ple. The following result illustrates that even in the deterministic case, semilinear 
equivariant automata are dangerously close to undccidability. We use below a 
term constant word for any word of the form x n , for some x£Z and n > 0. 

Theorem 3. The following problem is undecidable: 

- Input. A semilinear equivariant DFA with the integer atoms, with input 
alphabet Z. 

— Question. Does the automaton accept some constant word? 

Proof. By reduction of the halting problem for two-counter machines. We will 
use the same notation as in the proof of Theorem 1. In particular we will make 
use of the Godel encoding / of configurations of a two-counter machine with 
zero tests and of the semilinear function j:Z4Z that encodes the transition 
function of the machine. 
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Given a machine, define a semilinear equivariant automaton, with input al- 
phabet Z, as follows. Its states are 



ZU{e,T,±}, 

where e, T and _L have singleton orbits. The initial state is e, and the only 
accepting state is T. For the initial state, the transition function is defined by 

S(e, i) = f(x) + i G Z. 

For a state j £ Z, the transition function is defined by 



S(i,j) = 



t Hg(i-j) = f(y) 

.7 + .9(2 — i) otherwise 



The state T leads to _L on every input, and _L is a sink state: 

5(T,i) = ± 5(±,i) = ±. 

It is not difficult to see that S is equivariant, and that an input of the form 
m is accepted if and only if the two-counter machine goes from configuration 
x to configuration y in exactly m steps. An equivariant automaton accepts a 
constant word if and only if it accepts a word m for some to. This completes 
the proof. □ 

Corollary 2. It is undecidable if two semilinear equivariant DFA have nonempty 
intersection. 

Proof. The language of all constant words is easily seen to be recognized by a 
semilinear equivariant DFA. □ 

Observe how the delicacy of the decidability border between Theorem 2 and 
Theorem 3. Consider a semilinear equivariant DFA with states Q. For n G N, 
define P n C Q to be the set of states that can be reached after reading a constant 
word of length n. It is not difficult to see that P n is an equivariant subset of Q, 
and therefore there arc finitely many possibilities for P n . However, by Theorem 3, 
it is undecidable if there is some n such that P n contains an accepting state. 



5 Minimizing semilinear equivariant automata 

In this section we turn to the problem of minimizing DFAs. For any choice of 
atoms, in particular for the integer atoms, every DFA has an equivalent minimal 
DFA. We start by proving that when one starts with a semilinear equivariant 
DFA, the minimization operation stays in the realm of semilinear DFAs. 

Theorem 4. // a language L C A* is recognized by a semilinear equivariant 
DFA with the integer atoms then its minimal automaton is also a semilinear 
equivariant DFA. 
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Proof. Let A be a semilinear equivariant DFA recognizing L. Let Q be the state 
space of A and let P be the state space of the syntactic automaton, which is 
always equivariant. Let / : Q — > P be the mimimizing function, again always 
equivariant. By Corollary 1 we know that / is semilinear. Let S : Q x A — > Q be 
the transition function of the automaton A, which is semilinear by assumption. 
Our goal is to show that the transition function 7 : P x A — >• P of the syntactic 
automaton is semilinear. The function 7 is the same as the relation 

{(/(?), a, /(<*(?, a))) :?eQ>«e4 
which can be easily seen to be semilinear, assuming that <5 and / are. □ 

The theorem, however, says nothing about computing the minimal automa- 
ton. It even says nothing about deciding if an automaton is already minimal. 

Proposition 1. As far as decidability is concerned, deciding minimality is equiv- 
alent to computing the minimal automaton. 

Proof. Suppose that one can decide if an automaton is minimal. We show how 
to compute the minimal automaton. The algorithm runs two nested loops. 

— In the outer loop, we input a semilinear automaton A with states. First, we 
test if A is already minimal. If yes, the algorithm terminates and outputs A. 
Otherwise, the algorithm enters the inner loop. 

— In the inner loop, the algorithm searches through all equivariant functions 
/ : Q — > P, and tests each one to see if it is an automaton homomorphism, 
which can be expressed in Presburger arithmetic. Under the assumption that 
A is not minimal, the inner loop finds some homomorphism / which is not 
a bijection. Upon finding such a homomorphism, the inner loop terminates, 
and the outer loop is executed again. 

We claim that the outer loop can only be executed finitely many times. This is 
because the order 

X > Y if there is a surjective, non bijective, equivariant function / : X — > Y 

is well founded. The reason is that a non-bijective equivariant function must 
cither decrease the number of orbits, or decrease the characteristic of some orbit. 
By a characteristic of Zk we mean here the number k. Observe that the order 
docs admit arbitrarily long decreasing chains with the same starting point, e.g. 

Z > Z 2 k > Z 2 k-i > ■ ■ ■ > Z 2 > Zi for any k 6 N. 

□ 
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5.1 Partition refinement fails 



One natural approach to minimization problem would be to use a partition 
refinement algorithm, described below. 

Consider semilinear equivariant DFA A, with states Q. For n G N, define 
an equivalence relation ~„ on Q, which identifies two states if they accept the 
same inputs of length at most n. It is not difficult to sec that ~„ is a semilinear 
relation, because its definition can be expressed in Presburger arithmetic. Also, 
~„ is an equivariant equivalence relation on Q, which means that there is an 
orbit-finite quotient Q/~ n . Suppose that these equivalences stabilize at some 
n G N, which means that the equivalence relations ~„ and ~„+i are the same. 
Then it is not difficult to prove that ~„ is the Myhill-Nerode equivalence on 
states of the automaton, and the minimal automaton has states Q . 

Unfortunately, the equivalence might never stabilize: 

Proposition 2. There is a semilinear equivariant DFA, such that equivalences 
~„ never stabilize. 

Proposition 2 is shown using an automaton such that for every n G N, the set 
Q/~ n has three orbits, with characteristics 1, 1 and 2™ respectively. 

Proof. Consider an input alphabet 

A = {start, 0,1} x Z. 
Consider an automaton defined as follows. Its state space is: 

Q = ZW{e,J_}. 

The initial state is e. The orbit Z is accepting, while the orbits {e} and {_L} are 
rejecting. The transition function is defined below, for i, j G Z. When reading 
the definition below it is a good idea to look at the case j = 0. 

{i when a = start 
_L otherwise 

{ '~ J 2 ~' T + j when cr G {0, 1} and i — j — a is even 
_L otherwise 

_L 

Consider inputs to the automaton which are of the form 

startii) • cti(0) • • • cr n (0) for i G N and o\, . . . , cr„ G {0,1}. 

It is not difficult to see that such an input is accepted if and only if o~\ ■ ■ ■ a n is a 
prefix of the binary representation of i (the binary representation written with 
the least significant bit coming first). 

Consider the equivalence relation <~„ on Q defined by 

q^ n p if 5,p£Z and q = p mod 2". 



S ( e , (cm)) = 
(/M')) = 
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It is not difficult to see that for every a 6 A, we have 

g ~„ p implies S(q, a) ~„_i 5(p, a). 

It follows that states equivalent under ~„ have the same futures of length at 
most n. Also, the converse holds: if states are not equivalent under ~„, then 
they have different futures of length at most n. It follows that the equivalences 
~„ have more and more equivalence classes, as n grows, and never stabilize. □ 

5.2 Computing the minimal automaton 

So, how does one effectively minimize an automaton? Our approach is to re- 
duce the minimization problem to satisfiability for an extension of Presburgcr 
arithmetic, which allows a limited use of the divisibility predicate. 

A formula of existential Presburger arithmetic with divisibility (EPAD) is a 
formula of the form 

3x x ■ ■ ■ 3x n <j> (2) 

where the variables x\, . . . ,x n quantify over integers; and cf) is a quantifier-free 
formula in the language (Z, 0, 1,+, |), where | stands for divisibility. A typical 
instance of EPAD is deciding if there is a solution to the system 



3x 


= 3 


mod y 


5y 


= 7 


mod x 


2x 


= y 


- 18 



Theorem 5. [11] Satisfiability is decidable for EPAD. 

By a lengthy reduction to Theorem 5, we prove that one can effectively 
minimize a semilinear equivariant DFA. This is the main result of this paper. 

Theorem 6. Given a semilinear equivariant DFA with the integer atoms, one 
can compute the minimal automaton. 

The remaining part of the paper is devoted to the proof of the theorem. 
6 Proof of Theorem 6 

After some preparatory lemmas (Lemma 4 and Lemma 5 below) we formulate 
Theorem 7 below that essentially says that minimality is decidable for semilinear 
equivariant DFAs. By the virtue of Proposition 1 we know that this is sufficient 
for proving Theorem 6. 
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6.1 Decidability of minimality 

The following lemma shows that semilinear and equivariant subsets of monomial 
sets can be interpreted as semilinear, but not necessarily equivariant, subsets of 
monomial sets of lower dimensions. 

Lemma 4. The function 

XCZ k ^ {(z 2 ,...,i fe ):(0,i 2 ,...,i fc )eX}CZ fe - 1 

is a bijection between semilinear and equivariant subsets of Z k , and semilinear 
but not necessarily equivariant subsets ofZ k " 1 . 

Proof It is easy to see that the function produces semilinear sets, because the 
result of the function can be defined in Presburger arithmetic. The inverse of 
the function is defined by 

Fez*- 1 ^ {(Z,*i + i,...,i fc _i+0:(ii,...,*fc-i)ey} 

□ 

Consider a semilinear equivariant DFA A with states Q, input alphabet A, 
and transition function S. In the input alphabet, choose letters 

oi, . . . ,a n e A 

so that every orbit is represented. For each of these letters, consider the function 

Si : Q ->■ Q St(q) = S(q, a;). 

Lemma 5. An equivariant equivalence relation = on Q is a congruence in the 
automaton A if and only if it respects the final states and is a congruence for 
the functions S±, . . . , S n . 

Proof. Choose any states p,q <E Q such that p = q. We need to show that 

S(p, a) = S(q, a) for every a <E A. 

Choose then some letter a € A. By choice of a\, . . . , a n , there must be some a, 
and some permutation it such that a ■ ir = ai. By equivariance of S, we have 

S(q, a) ■ 7r = S(q ■ 7r, a ■ tt) = S{(q ■ tt) 

Because = is equivariant, it follows that p ■ ir = q • ir, and therefore by the 
assumption on =, we have 

Si(q ■ tt) = Si(p ■ 7r). 
By the same reasoning as above, we have 

Si(p ■ tt) = S(p,a) ■ tt. 

We have just proved that 

S(q, a) ■ tt = S(p, a) ■ tt. 
By equivariance of =, it follows that S(q, a) = S(p, a). □ 
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Observe that the functions 6i,...,5 n are not necessarily equivariant. Actu- 
ally, they can be completely aribtrary, because every choice of functions Si, . . . ,5 n 
can be lifted to an equivariant transition function 

5 : Q x A — >• Q defined by 5(q, a) = Si(q) ■ tt 

where cij and it G Z arc the unique elements satisfying a — a, ■ ir. Because we 
are dealing with a semilinear automaton, all that we know is that 5\,. . . ,S n are 
semilinear. That is why, in order to prove Theorem 6, we need to decide if there 
is an equivalence relation on Q which respects the final states, and which is a 
nontrivial congruence with respect to the arbitrary semilinear unary operations 
Si,...,S n . This is shown in Theorem 7, which is the main result of Section 6. 

Theorem 7. The following problem is decidable: 

- Input. An orbit finite set Q, an equivariant subset F C Q, semilinear but 
not necessarily equivariant functions Si, . . . , S n : Q — > Q. 

— Question. Is there an equivalence relation = on Q which: 

1. is nontrivial, i.e. identifies at least two different elements; 

2. respects F; 

3. is a congruence with respect to Si, . . . , S n ; 
4- is equivariant. 

The proof strategy for the theorem is as follows. 

1 . We show that every equivariant equivalence relation = on Q can be described 
by a signature, which consists of: 

(a) The equivalence type: an equivalence relation ~ on the orbits of Q. 

(b) The equivalence parameters: a finite vector of integer parameters. 
This step is done in Section 6.2. 

2. We show that for every choice of the equivalence type (there are finitely 
many choices), whether or not the equivalence relation = is a congruence 
with respect to Si,. . . ,S n can be described, in terms of the equivalence pa- 
rameters, by an existential formula of Presburger arithmetic with divisibility 
predicates. This step is done in Section 6.3. 

3. We recall Theorem 5, which says that satisfiability is decidable for existential 
formulas of Presburger arithmetic with divisibility predicates. 

The remaining part of Section 6 is devoted to the proof of Theorem 7. 

6.2 The signature of an equivariant equivalence relation 

In this section, we present the first step of the proof of Theorem 7. We show how 
every equivariant equivalence relation on an orbit finite set can be described by 
a finite piece of information and a vector of numbers. 

Consider an orbit-finite equivariant set Q, together with an equivariant equiv- 
alence relation =. Define a relation [=] on orbits of Q by: 

n [=] t 2 if qi = q2 for some qi G t\ and some q<i G r 2 . 

This relation [=] is called the equivalence type of =. 
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Lemma 6. The relation [=] is an equivalence relation. 

Proof. The only nontrivial part, transitivity, follows from equivariance, as shown 
below. Suppose that 

ti [=] t 2 and t 2 [=] t 3 

hold for three orbits n, t 2 , t 3 . This means that 

qi = g 2 and pi = p 2 for some gi G n,q 2 ,p 2 G r 2 ,p 2 G r 3 . 

Sincd q 2 and pi are in the same orbit, there must be some n such that pi-n = q 2 . 
By equivariance of =, we have pi ■ ir = p 2 ■ ir. Therefore, by transitivity of =, we 
have qi = p 2 ■ it, and therefore t\ [=] t 3 . □ 

We denote by Q/= the set of equivalence classes of Q under =. Consider the 
quotient mapping 

/ : Q -> Q/= 

which maps an clement of q to its equivalence class under =, denoted by [q] = . It 
is not difficult to see that / is an equivariant function. As an image of an orbit- 
finite set under an equivariant function, the quotient Q/= is orbit-finite. Also, 
it is not difficult to see that the orbits of Q /= are in one-to-one correspondence 
to equivalence classes of the equivalence type [=] . 

So far we have defined the equivalence type of =. This does not yet determine 
the equivalence relation =. The missing information is: what are the character- 
istics of the orbits in the quotient Q/=, and how does the quotient function 
identify elements of Q. 

— Let S be an equivalence class of [=], corresponding to an orbit of Q/=. 
Define char(Z') G N to be the unique number such that the orbit of Q/ = 
that corresponds to £ is isomorphic to Z c h ar3i (2;)- 

— Let S be an equivalence class of [=], and let r, a G S be orbits of Q. The 
images /(t) and f(a) are equal, and isomorphic to Z char4 ,(£). Let O^O^ be 
the copies of in the orbits t, a. Define 

diff (r, o) /(0 ff ) - /(0 T ) G Z char(l;) . 

In principle, diff (t,<t) is an element 3 of l-c^t^E)- However, it will be con- 
venient to treat it as an integer. That is why, we think of diff (t,ct) as any 
integer in the set 

diff (t, cr) + char(Z') • Z. 

If there will be a need to make differ, cr) unique, we can choose it to be the 
smallest positive integer in the set above. 

3 In the definition of diff(r, cr), we implicitly use some isomorphism between r L c \ iZI (s) 
and the orbit /(t) = f(u). There are, however, many possible isomorphisms. It 
can easily be checked that the difference /(0 CT ) — /(0 T ) docs not depend on the 
isomorphism. 
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Define signature of the equivariant equivalence relation = to be the triple 
<P= d = ([=], diff, char), 
which contains all the information defined above. This defines a mapping 

= i y $=. (3) 

We claim that the signature determines = uniquely. To prove this, we claim that 
there is an inverse transformation 

<P ^ =4, (4) 

which maps a signature to an equivariant congruence. 

Of course, the input to the inverse transformation needs to be consistent. 
This consistency condition is captured by the notion of an equivalence signature. 
An equivalence signature is a triple 

<P = diff, char) 

where ~ is an equivalence relation on orbits of Q and diff, char are vectors 

diff : {(r, a) G orbits(Q) x orbits(Q) : r ~ a} -> Z 
char : classes(^) — > N. 

such that the following consistency conditions hold for every equivalence class 
£ of ~: 

- for all n, r 2 , t 3 G S: 

diff(Ti,T 2 ) + diff(T 2 ,T 3 ) = diff(ri,T 3 ) mod char(Z'). 

- for all t e S, 

k = mod char(Z'), 
where t is isomorphic to Z^. 
The first condition above implies, in particular, that 

diff(r, t) = mod char(Z") and diff(r, a) = — diff((7, r) mod char(Z'). 

In the sequel we will write char(r) instead of char(I7), when r e S. Clearly, the 
mapping (3) is surjective onto equivalence signatures. 

The following proposition shows that an equivariant equivalence relation on Q 
is completely described by its signature. (Note however that there are in general 
many different signatures defining the same equivalence, because diff(r, a) is 
only determined modulo char(r) = char(cr).) 

Proposition 3. The exists a mapping (4) that maps equivalence signatures to 
equivariant equivalences, that is an inverse of the mapping (3), namely: 

=4>= = = for any equivariant equivalence = . 
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6.3 Which signatures describe congruences 

Proposition 4. Let ~ be an equivalence relation on orbits of Q, and let 

f :Q->Q 

be a semilinear function. There set 

Xf^ — {(diff, char) : (~, diff , char) is an equivalence signature and 
=(~,diff.char) is a nontrivial congruence for /} 

is definable in EPAD. 

Before proving Proposition 4, we show how it implies Theorem 7. 

Proof (of Theorem 7). Recall that Theorem 7 says that one can decide if there 
exists some nontrivial equi variant equivalence relation on Q, which respects F C 
Q, and which is a congruence with respect to semilinear operations 5\,. . . ,S n . 
By Proposition 3, every equivariant equivalence relation on Q is of the form =<p, 
for some equivalence signature 

<P = (~, diff, char). 

Since the set F is equivariant, it is a union of orbits F = n U • ■ • U Tfc. This 
means that =<z> respects F if and only if the equivalence relation ~ respects the 
set {n, . . . , Tfc}. Recall the sets Xf t ~ from Proposition 4. There is a nontrivial 
congruence on Q which respects F if and only if the following set is nonempty 

u n 

~ respects Ti ..... T k ie{l,....n} 

As EPAD admits positive boolean combinations, it follows from Proposition 4 
that the above set is definable in EPAD. We can therefore invoke Theorem 5 to 
test if the set is nonempty. □ 

The rest of Section 6.3 is devoted to proving Proposition 4. Fix the relation 
~ on orbits of Q for the rest of this section. 

We will identify an element of Q with a pair (r, i), where r is an orbit and i 
is an integer. When the orbit r has characteristic k > 0, then this representation 
is many-to-one, because the pairs (r, i) and (r, i + k) represent the same element. 

Lemma 7. Let <L> = (~, diff , char) be an equivalence signature. The equivalence 
=<p is a congruence with respect to a function f if and only if 

V« G Z /(t, i) =<p /(t, i + char(r)) for every orbit t (5) 

Vi £ Z /(r, i) =<p /(cr, « + diff (r, ct)) /or every or&iis r ~ a (6) 
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Proof. The left-to-right implication is immediate. We only do the right-to-lcft 
implication. A short argument is that the relation =<p is the smallest equivalence 
relation generated by the pairs 

(t, i) =<j (t, i + char(r)) (r, i) =<j> (cr, i + diff (r, a)) 

ranging over integers i E Z and orbits r ~ <r. Therefore, if / respects the 
generators, it will also respect the whole equivalence relation. □ 

The key technical lemma is Lemma 8, stated below. 
Lemma 8. Fix f , ~ and orbits r,a. The set 

*/,~,r,a - {(diff, char, A) : Vz G Z /(r, i) = Kdiff , char) /(a, i + A)} (7) 

is definable in EPAD. 

Before proving the lemma, we show how, together with Lemma 7, it implies 
Proposition 4. 

Proof (of Proposition 4). In order to prove Proposition 4, it is sufficient to es- 
tablish two conditions. First, we claim that for every choice of ~, the set 

{(diff, char) : (~, diff, char) is an equivalence signature} 

is definable in EPAD. Then, by Lemma 7, it suffices to show that for every choice 
of ~, each of the properties (5), (6) can be defined in EPAD. This is exactly what 
Lemma 8 says. □ 

The key obstacle for expressing (7) in EPAD is that it has the universal 
quantifier Vi, which is not allowed in EPAD. The is idea to use a technique of 
quantifier elimination, which is presented in the following lemma about arith- 
metic progressions. A finite arithmetic progression is a set a + p ■ {0, . . . , k}. An 
infinite arithmetic progression is a set a + p ■ N. Both the base a and the period 
p can be negative. 

Lemma 9. Fix an equivalence signature <P = (<~, diff, char). Let f\,fa : Z — > Q 
be affine functions defined by 

fi(x) = (n, oi -x + bi) f 2 (x) = (r 2 , a 2 ■ x + 6 2 ), 

for orbits n ~ t 2 and coefficients 01,61,02,62 S Z. Let X C Z be a finite or 
infinite arithmetic progression. For any two consecutive elements Xq,X\ E X, 

yxEX h{x)=*f 2 {x) iff f\ h{x)=^f 2 {x) 

i£{io^i} 

Proof. We only prove the nontrivial right-to-left implication. By unraveling the 
definition of functions /1 and f 2 , fi(x) =<j> fi{x) is equivalent to 

a(x) = mod char(n) 
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for the function a : Z — >• Z denned by 

a(x) = a\ ■ x + bi — diff (n, T2) — 02 • x — 62- 
This is an affine function. Suppose then that 

y\ a(x) = mod char(n) 

xd{x a ,x 1 } 

holds. Then it follows that 

Vfc G Z a(x + k ■ (x\ — xo)) = mod char(n). 

In particular, because x\ and xq are consecutive elements in the progression X, 
it follows that 

Viel a(x) = mod char(n), 

which means that 

VxeX h{x) f 2 {x) 

as required. □ 

Proof (of Lemma 8). Consider the functions g,h : Z — »• Q defined by 

g( x ) = f{T,x) and h(x) = f{<J,x). 

As / is scmilinear, we know that there is a partition of Z into a finite family of 
arithmetic progressions {Xi} i£ i (respectively, {Yj}j e j), and a family of affine 
functions {gi : Z — > Q}i£i (respectively, {hj : Z — > Q}jej), such that on argu- 
ments from Xi, the functions gi and g coincide (respectively, on arguments from 
Yj, the functions hj and h coincide). 

Expressed in terms of these partitions and affine functions, condition (7) from 
Lemma 8 becomes 

Vx f\ (x G Xi A (x + A G Yj)) => g l (x) hj(x + A) 

ieijeJ 

(we write shortly <P instead of (<~, diff , char) here and in the sequel) which is 
the same as the conjunction, ranging over all choices of i G / and j G J, of the 
properties 

Vx G (X t n (Yj- - Zi)) gi(x) =$ hj(x + A) (8) 

Therefore, it is sufficient to provide a formula of EPAD for each formula of the 
form (8). Let us fix then i and j in the sequel. 

As an intersection of two arithmetic progressions, the set Xi n (Yj — A) is 
an arithmetic progression. The sets Xi and Yj are known, only A is unknown. 
Depending on the value of A, the progression Xi n (Yj — A) might have zero, one 
or at least two elements. Thanks to Lemma 9, we know that (8) is equivalent to 
the following property: 
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(Al) If Xi n (Yj — A) has at least one element x, then 

gi{x) =4, hj(x + A). 

(A2) If Xi n (Yj — A) has at least two elements x\,x 2 , and they are consecutive, 
then 

f\ 9i(xk)=^hj(x k + A). 
fee{i,2} 

We claim that both these properties can be formalized in EPAD. Let us focus 
on the more difficult property (A2). It is not difficult to see that the set 

cons = {(A, xi, X2) ■ Xi,X2 are consecutive elements of Xi n (Yj — A)} 

are Presburger definable. Property (A2) is an implication. The head of the im- 
plication, "if Xi n (Yj — A) has at least two elements £1,2:2" is defined by a 
Presburger formula 3 x\x 2 cons(A,x\,X2)- The tail of the implication can be 
seen as the formula: 

3 xix 2 cons(A,x 1 ,x 2 ) A f\ gi(x k ) =4, h (x k + A), 

fe€{l,2} 

which is definable in EPAD. This completes the proof of Lemma 8, being the 
last part of the proof of Theorem 7. □ 
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